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Abstract. In Boolean algebra, it is known that the logical function that 
corresponds to the negation of the conjunction -NAND- is universal in 
the sense that any other logical function can be built based on it. This 
property makes it essential to modern digital electronics and computer 
processor design. Here, we design a molecular Turing machine that com¬ 
putes the NAND function over binary strings of arbitrary length. For 
this purpose, we will perform a mathematical abstraction of the kind of 
operations that can be done over a double-stranded DNA molecule, as 
well as presenting a molecular encoding of the input symbols for such a 
machine. 
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1 Introduction 

Moore’s Law is the empirical observation that the number of transistors on in¬ 
tegrated circuits doubles approximately every two years. However, it is known 
that the continuous miniaturization of electronic devices has undesirable con¬ 
sequences: fitting more and more chips in an area that is becoming smaller 
according to the standards of the market results in an uncontrollable increase 
of heat when the device begins its operation [?]. Hardware manufacturers are 
reaching the physical limit when attempting to develop more powerful devices 
in the the least space possible. Moreover, the holy grail of digital storage is be¬ 
ing able to store a bit per atom, which implies massive storage in a very small 
volume. However, theory says that in order to achieve such an endeavour, one 
would have to pay the quantum price for dealing with subatomic particles. 

An alternative to electronic and quantum computing is molecular computing, 
which also attempts to reduce the scale of computation and at the same time in¬ 
crease its power. Molecular computing uses as hardware (rather wetware) organic 
molecules such as ribonucleic acid (RNA) and deoxyribonucleic acid (DNA). In 
this work, we focus on the latter, which is known elsewhere as DNA computing. 

There are two main reasons that make DNA an attractive computing medium, 
one is parallelism and the other, complementarity. The density of information 
encoded in a DNA strand, plus its relatively ease to generate copies of it (via 
techniques such as PCR [?]), offers the possibility of realising parallel computing 



massively. For example, Leonard Adleman, one of the pioneers of this comput¬ 
ing paradigm, presented in [?] a solution to the Hamiltonian Path Problem by 
probing in parallel different solutions to this NP-complete problem. On the other 
hand, complementarity refers to the fact that a strand of DNA is composed by 
two single strands of nucleotides that bind naturally through their bases. There 
are four of such bases: guanine (G), adenine (A), thymine (T), and cytosine 
(C). Given their chemical composition, nucleotides with base G bind naturally 
to nucleotides with base C, this fact makes them complementary bases, whereas 
nucleotides with base A bind to those with base T. As mentioned before, this 
complementarity is given free by nature and it can be exploited to model and 
encode a particular task. 

Representing a double strand of DNA as two single strands, one above the 
other taking into account the complementarity of their bases, is a big simplifi¬ 
cation when actually both strands are coupled forming the well-known double 
helix. Our representation of such strings in terms of a strand that extends itself 
horizontally will help us through our exposition in the following pages. However, 
we should also mention that in this work we adopt the convention of express¬ 
ing the direction of a single string of DNA as going from the 5'—end to the 
3'—end, which is given by the nature of its chemical components and that is 
used extensively in biochemistry [?]. 

DNA is one of the main computing media in molecular computation, and it is 
crucial in the processes inside the living cells of organisms. Its two main functions 
are the codihcation for the creation of new proteins, and self-replication, such 
that an exact copy of itself is inherited to cellular offspring. As a computing 
medium it has been used to solve the Hamiltonian Path Problem and other 
combinatorial problems [?], as well as being used as a mechanism to detect the 
presence of molecules that signal the emergence of a chronic disease [?][?]. In 
[?] we present the design of an enzyme-free molecular machine that detects the 
emergence of hepatic fibrosis, and upon detection releases a molecule to interfere 
with the over expressed gene. 

In this paper, our motivation comes from the work of Vineet Gupta et al. 
[?] where the authors present a technique to simulate logic and arithmetic func¬ 
tions through DNA molecules. However, the issue that arises in such work is 
that the method suggested lacks proper automation, that is, depends greatly in 
the presence of human operation, which contradicts the essence of computation, 
namely, the automation of processes that would imply a big effort, cost, and 
time in a situation in which those resources are limited. Our suggestion, thus, 
is the design of a DNA machine, which mimics the behaviour a particular log¬ 
ical function autonomously, that is, minimising the intervention of an external 
operator. The logic function that we will simulate molecularly is the inverted 
AND gate, also known as NAND gate, which is functionally complete. This fact 
implies that any other logic function can be expressed as a combination of gates 
of this type. Hence, its importance in the design of digital electronics, such as 
computer processors. 



We make use of a set of operations that can be performed on a DNA strand. 
This set can be regarded as a gift from biochemistry to this novel computing 
paradigm. The kind of operations that can be performed on a DNA strand 
are: duplication, concatenation, cleavage, extension, length measurement, among 
others. For the type of problem that we address here, we need only a small set of 
operations, which we briefly describe below. For a more exhaustive description 
we refer the reader to [?]. 


— Measuring a DNA strand’s length. The length of a double strand of DNA is 
given by the number of base pairs (a nucleotide and its complement) and is 
denoted by the unit bps. To measure this length a technique known as gel 
electrophoresis is employed. 

— Concatenation and separation of a DNA strand. DNA strands are joined 
by a covalent bond horizontally (that is, when nucleotides are adjacent in 
a single strand), and by a hydrogen bond vertically (that is, when bases 
are complementary). The latter is weaker than the former, so it is possible 
to disjoin two complementary strings of DNA leaving the nucleotides in 
each single string intact just by increasing temperature. If the solution is 
cooled down again, the single strings are susceptible of joining again. This 
process must be performed slowly so strings have a chance to recognise their 
complementary bases. Then, a type of enzyme known as ligase is used, which 
joins covalently two consecutive nucleotides [?]. 

— Cutting a DNA strand. A double-stranded DNA molecule can be cleaved 
by the action of enzymes. Their action can be performed at the ends of a 
molecule or within it. The latter type of enzymes is known as endonucle¬ 
ases, or restriction enzymes. These are found in bacteria and they provide 
their hosts with a defense mechanism against viruses. These enzymes acquire 
their name from the organism from which they were obtained. For example, 
enzyme EcoRI receives its name because it was the first enzyme to be identi¬ 
fied (hence, /) in the bacteria E. coli strain R. Restriction enzymes work by 
identifying a particular DNA string and then cleaving the molecule either at 
this recognised site or a few nucleotides away from it. In this work, we use 
this latter type of enzymes, which are known as restriction enzymes type IIS 
? 


With these ideas in mind, we present our model in the following pages. The 
structure of this paper is as follows, in the next section we present the mathe¬ 
matical version of the Turing machine that computes the logical NAND function. 
In section |2.1[ we present the molecular version of this Turing machine, and we 
present the main result of our work, which is in the shape of a theorem sum¬ 
marising the mathematical abstraction of the elements comprising the molecular 
machine. Finally, in section]^ we discuss some of the implications of our model 
along with some conclusions. 


2 Model 


Broadly speaking, a Turing Machine (TM) is a mathematical object that mimics 
the action of a machine consisting of a head that reads and writes on a tape 
obeying a fixed set of rules. Although simple in its design, a Turing machine 
is a powerful computing device as it can be adapted to mimic the logic of any 
computer algorithm. 

In theory, the tape of the machine has infinite length and its head moves 
one square at a time along it according to a program specified as a set of rules. 
Formally, a TM is defined as follows: 

Definition 1. A Turing Machine is the 7—tuple {S, S, j3, S*,T, sg, A), which 
consists of: 

1. A finite set of states S. 

2. A finite set of symbols (or alphabet) S. 

3. A symbol j3 G S representing a blank character. 

4- A finite set E* C S \ {/?} of input symbols. 

5. A transition function T :SxE^SxEx {L, R}, where L (R) denotes a 
single left (right) movement of the head on the tape. 

6. An initial state Sq G S. 

7. A finite set of accepting states A G S, for which the machine halts and the 
computation is over. 

As mentioned before, the logic function that we implement on a TM is an 
inverted AND gate, whose truth table is shown in table Our TM reads two 
binary strings of length n from its tape, and then computes the NAND function 
entry by entry following the rules specified by its truth table. As a result, the 
machine writes a binary string of length n on its tape with the result of the 
computation. 


Table 1. NAND truth table 


p 

q 

p NAND q 

0 

0 

1 

0 

1 

1 

1 

0 

1 

1 

1 

0 


The first convention that we will adopt concerns the way in which we write 
the input strings on the machine’s tape. Let A = (oi, 02 ,..., a„) and B = 
(5i, 62 ,..., bn) be two binary strings of length n. We write the strings A and B 
intercalating each element of their entries on the tape. That is: 






«! 

bi 

02 

^2 





With this in mind, we define our TM in the following way. Our alphabet is 
the set S = {0,1, e, /3}, where 0 and 1 are the characters comprising our binary 
input and output strings, e is a symbol that will serve as an error detection 
mechanism at the end of the computation, and /3 is our blank character. Thus, 
S* = {0,1}. We consider the set of states S = {Sq,Si,S2tHALT}, where 
HALT S A represents the accepting state which stops the computation, and Sq 
is the initial state of the machine. Finally, the transition rules of our machine, 
and which ultimately mimic the behaviour of a NAND gate, are the following: 


{So,0)^ {Si, {3,^) 

(1) 

{So,l)^ {S2,13, 

(2) 

{So, (3) ^ {HALT) 

(3) 

{Si,0) ^ {So,l,^) 

(4) 

{Si,l) ^ {So,l,^) 

(5) 

{Si,l3)e^ {So,e,^) 

(6) 

{S2,0) i-t (^o,!,^) 

(7) 

{S 2 ,l) ^ {So,0,^) 

(8) 

{S2,I3) ^ {So,e,^) 

(9) 


Each of the transitions are interpreted in the following way: 

{current.state, character.being.read) i—>■ {new.state, characier.to.write, move.head.to) 

In the following section we will implement all the elements of this TM into 
DNA molecules and we will provide a mathematical proof of its operation. 

2.1 Molecular Turing Machine 

In this section we present the molecular version of the TM defined previously. We 
perform a molecular encoding of all the elements of the TM recently described, 
which means that we will represent the alphabet, the transitions and states as 
DNA strands. In summary, a circular double-stranded DNA molecule of DNA 
will act as the tape of the TM, in which a substring will simulate the action of 
the head. Using ideas from [?], we use restriction enzymes to design the hardware 
of our machine. As mentioned previously, we use type IIS restriction enzymes to 
cut DNA strands at a defined distance from their non-palindromic asymmetric 
recognition sites [?], which in other words means that the recognition and the 
cleavage sites are away from each other. 

In table we describe the five restriction enzymes that we employ, their 
recognition site, the distance between it and the place the cut is performed. 












and the direction in which they perform such cut. The nucleotide Ni denotes 
an arbitrary base. We must point out that these enzymes cut from right to left, 
or vice versa, whenever they find their recognition sites in the top string (in 
direction 5' to 3') or its mirrored version in the bottom string (in direction 3' 
to 5'). However, we only specify the direction of cleavage in the way we require 
for the purposes of our machine. After cleavage these enzymes leave a sticky-end 
in the strings that have been cut. This sticky-end is an overhang of nucleotides 
in which another molecule can be joined. We group these five enzymes in a 
mathematical set that we name <P. 


Table 2. Restriction Enzymes 


Name 

Direction 

Recognition Site 

Restriction Site 

FokI 


5'-GGATG-3' 

3'-CCTAC-5' 

5' - . . . Ag 

3^ — ... AgAioAii A 12 A 13 

BsrDI 


5'-GGAATG-3' 

3'-GGTTAG-5' 

5' - A 1 A 2 

3' - ... 

Bpml 


5'-GTGGAG-3' 

3'-GACGTC-5' 

5^ — ... A 14 A 15 Ai 6 

3' - ... Ai4 

Bpml 


5'-CTGCAG-3' 

3'-GAGGTG-5' 

Ai4 ... - 3' 
A 16 A 15 A 14 ... - 5' 

Bserl 


5'-CTCCTC-3' 

3'-GAGGAG-5' 

As ... - 3' 
AioAgAs ... - 5' 

Bbvl 

- 

5'-GCTGC-3' 

3'-CGACG-5' 

Ai 2 All AioAgAs ... — 3^ 
As ... - 5' 


To simplify our exposition we adopt the following conventions regarding the 
representation of DNA strands, in particular, the representation of the recogni¬ 
tion sites and cleavage direction of the restriction enzymes. 

Notation 1. A DNA strand (that is not linked by its extremes) will be repre¬ 
sented as a string of characters enclosed by square brackets. 


Whereas, a DNA strand that is linked by its extremes will be represented as a 
string of characters enclosed by simple parentheses. 


( 


Notation 2. The string denotes the recognition site of restriction enzyme 

RS G whose cleavage direction is from left to right. Analogously, the string 
denotes the same enzyme, but this time its cleavage is from right to left. 




We will use a word size of 6 bps to encode the symbols in the set S, plus 
a suffix of 4 bps, which serves as character delimiter in the molecular tape. We 
do not give an explicit declaration of the bases comprising the strands from the 
alphabet; as long as they do not contain any of the recognition sites used in the 
set the choice of their bases is arbitrary. Thus, our symbols, plus the suffix, 
look like: 


A1A2A3A4A5AQ . F1F2F5F4 


0= I(loi 

A1A2A2A4A5AQ F1F2F2F4 

^ BiB2B3B4B^Bq .F1F2F3F4 fll) 

BIB2B3B4BQBQ FIF2F3F4 

P ^1^2/33^4/35/36 I FiF 2 F^F 4 (12) 

^^ 1 ^ 2031^40506 F1F2F3F4 _ 

61828^84656^ .FiF2F^F4~\ 

8= - I - (13) 

_£i£ 2 ^ 3 ^ 4 ^ 5^6 F1F2F3F4 _ 

Moreover, during computation time the current state of the TM is decoded 
in the sticky-end of the last cleaved molecule. Let us consider the following DNA 
strand: 

^ 7V2 A/'s iV4 ATg iVe . i^iA2-^3-^4 

1. The following situation reports that the TM is in state Sq with input N. 

^ Ni N2 A/'s N4 A/'s Nq . Fi F2 F2 F4 

A/'s Nq Fi F2 Fq F4 

2. The following situation reports that the TM is in state S*! with input N. 

^ Ni A/'s A/'s N4 A/'s Nq . Fi F2 Fq F4 

Nq Fi F2 Fq F4 

3. The following situation reports that the TM is in state S 2 with input N. 

Ni N2 A/3 N4 Nq Nq . Fi F2 Fq F4 


N = . 

Fi F2 Fq F4 _ 

We further extend our notation in order to make this exposition clearer. 

Notation 3. The string [Xy] where X is a natural number, and Y G E, rep¬ 
resents X number of nucleotides corresponding to symbol Y. Whereas the string 
[A] represents X number of arbitrary nucleotides. 

For example, the string [6i|4i?] is the simplified version of the DNA strand 


(11) described above. With this in mind we are in position to present the molec¬ 


ular version of the transition rules that were described in the previous section. 

Ti ■.<So,Q>^<Si,p,^> 

BsrD\\'lF\&p\Ap\Q\f3serI\Fok\\A\'lp\12\f3pmI\Bpm\\3\QQ\Q\l3bvI 


( 14 ) 













































Where the string 


[HALT] (23) 

denotes the double-stranded DNA molecule that acts as detection molecule 
for the TM, and indicates that the computation is over. The choice of base pairs 
that comprise this molecule is arbitrary as long as it does not interfere with the 
action of the restriction enzymes. As done previously, we group these molecules 
in a mathematical set that we name T. 

The computation begins when the circular double-stranded DNA molecule, 
in which we encode the input string, is mixed with all the transition molecules 
and the restriction enzymes. The head of the machine is represented by the 
recognition sites of restriction enzymes FokI and Bserl. 



The computation takes place immediately when the restriction enzymes react 
to the contents of the circular double-stranded DNA molecule, and the transition 
molecules bind to it via their sticky-ends. At the end of the computation, the 
contents of the tape can be read on the DNA strand, and this corresponds to 
the output of the TM. 

After having described the contents of the sets F (transition molecules), F 
(restriction enzymes) and E (alphabet molecules) we are able to announce our 
main theorem. 


Theorem 1. < S > is a molecular Turing machine that computes a logical 
NAND function. 


The proof of this theorem can be found in the Appendix section of this 
paper, which basically proves that our design of transition molecules, alphabet 
and restriction enzymes achieves to mimic the action of a logical NAND gate on 
a molecular TM. Thus, showing that it is possible to build a molecular version 
of a logic unit, which together with an arithmetic unit, is an essential part of a 
central processing unit of any computing device. 

Figure 0 in page | 


20 shows an schematic example of the TM operation 


with input 01, where we show how enzymes cleave the molecule, and how the 
transition molecules bind to it mimicking the action of the head reading/writing 
on the tape and then moving along it. For input 01 we expect the machine to 
return 1 as output (see table 0 above), which is the case for our design. 

The action of our TM can be summarised in the following steps, naturally we 
are assuming ideal conditions, which means that our molecules are not affected 
by other substances in the medium and computation is carried out flawlessly: 


1. Enzymes FokI and Bserl recognise their binding sites and cleave the molecule 
accordingly. Because the molecule is a circular DNA strand, no nucleotide is 
lost after cleavage. 

2. Transition molecules enter the medium after being cleaved by enzyme BbvI. 

3. In the medium, enzyme BsrDI acts upon the transition molecules, which 
makes them susceptible to bind to the main strand. 

4. The enzyme ligase aids the coupling of transition molecules and the main 
strand in virtue of their complementary sticky-ends. 

5. To delete the character that has been read from the tape, enzyme Bpml acts 
upon the DNA strand, which cleaves that particular section of the string. 

6. Once more, enzyme ligase bind together the complementary sticky-ends, 
which results in a complete circular double-stranded DNA molecule. 

7. The first step is repeated until no recognition site is found within the main 
molecule, which implies that the string representing the HALT state is con¬ 
tained in the molecule. 


3 Discussion 

An important factor that must be taken in consideration when implementing this 
kind of molecular devices in the lab is the sequence of steps in which computation 



takes place. In our mathematical abstraction, these steps occur one at a time and 
without errors on a unique DNA molecule. In reality, we would have thousands 
of these molecules, plus restriction enzymes and other molecules required for the 
computation (e.g. ligase and ATP) in a single test tube, all of them influencing 
each other in one way or the other. Therefore, we would expect that only a 
percentage of those strands will perform the computation as planned. 

Should this fact discourage us? We should keep in mind that the molecular 
computing paradigm is still in its infancy, and a lot of work must be done both in 
the theoretical and in the experimental fronts. Research efforts must be directed 
towards developing mechanisms for error detection and fault tolerance. In [?], 
the author presents a series of strategies to deal with undesired effects during 
molecular runtime. Examples of these adverse effects are: faulty or incorrect 
bonds, and faulty or incorrect cuts. Strategies to deal with them include the use 
of exonucleases and endonucleases with no specific restriction site to detect and 
correct errors during run-time [?]. 

Genetic material is a promising computing medium due to its relative ease of 
parallelisation and its natural tendency to bind to complementary strings. This 
features offer the possibility of solving hard computing problems with less effort 
and cost than it would require to conventional computation. The cost of building 
and maintaing a computer cluster is very high in terms of space, energy and other 
human and technical resources. On the other hand, molecular computing offers 
the possibility of having thousands of molecules computing in a space as small 
as a teaspoon, without any negative environmental impact. 

Nevertheless, it is very likely that up to now this computer paradigm might 
not perform any better than our modern computers. However, we should take 
a few steps back and think about the nature of this new computing paradigm. 
Genetic material as a computing device opens a new perspective on the kind 
of problems that we could face with it. DNA is the native language of the cell, 
therefore using the former as software and the latter as hardware gives us a new 
opportunity to deal with medical conditions that remain intractable to date. 
One of the first steps towards this direction is the construction of an arithmetic 
and logic unit made entirely from DNA molecules. Here we offered an insight for 
such a purpose. 

Appendix 

Theorem 1. < r,<P, S > is a molecular Turing machine that computes a logical 
NAND function. 

Proof. We proceed in the following way. Let us consider a string as long as the 
proof requires, that is, a circular double-stranded DNA molecule with undefined 
length. Thus, we have the following initial configuration: 

( 6 / 3 | 4 i?| 6 |i 3 seH/|Fofc)| 9 | 6 Ai | 4 f| 6 a 2 | 4 f| • • • | 6 a. | 4 f| • • •) ( 24 ) 

At this point, the head of the machine is placed over the character Ai. The 
next step occurs when enzymes FokI and Bserl recognise their sites and cleave 



the molecule. This action leaves the molecule in the following configuration: 





|4f|6a2|4f| 


( 25 ) 


The sticky-end A11A12A13A14 indicates that the machine is currently in state 
So with input Ai. Because the molecule is circular, that is, it is linked by its 
extremes, the computation will proceed and a molecule has a chance to fill the 
gap caused by the action of the enzymes, provided it has a complementary 
sticky-end. Let us analyse the different cases for the value of character Ai. 


CASE 1: < So, Xi = P > Then, string (|25[) is actually the string: 


^ I ^ 1-^2 I I /3i/32/53/34/35/36 I |^ | 

6/3|-1 l““—T^K fISasKfI 

05^6 


(26) 


And thus, we expect transition T3 (16) to take place. The molecule repre¬ 


senting such transition gets close to the empty space in string (26) as shown 
below: 






5;75^|4f| halt |6/3|6|fer/| 


(27) 


However, this molecule cannot incorporate to the main strand yet, as it does 
not exhibit any sticky-end. This happens after enzymes BsrDI and Bbvl act 
on it, which results in the following string: 



_ 1 HALT I _ 

/3i^2/53/34 


010203040506 


0506 



(28) 


As the sticky-ends in both molecules are complementary, they are susceptible 
of binding together, which results in the following molecule: 


(6;3|4f| alt |6;3|4f|---) 


(29) 


Resulting in a complete molecule with no recognition sites on which an en¬ 
zyme could act. Thus, at this stage the computation ends as expected. 
CASE 2: < So, Ai = 0 > Thus, string (|^ is actually the string: 


6 / 3 I-1 I-ZFzr| 4 F| 6 A 2 | 4 F| 


(30) 


In this case, transition Ti (14) should take place, which at the moment of 


attempting to bind to the main molecule results in: 


6/31- 


A-iA2A^A4A5Aq 


^ 14_p 16^ 14/7161 I 1414p 1121 I 1 8 1 60 1 6 1 1 


( 31 ) 



















After enzymes BsrDI and Bbvl act upon transition molecule Ti they yield 
sticky-ends in both extremes which are susceptible of binding to the main 
molecule, which results in: 



A2A3A4A5A6 

^5^6 


I _ | 6 ^| 4 f | 6 |i; 3 eT-i-|Fofc^j| 4 | 4 f |l 2 |iipmi-|Bpm^j| 8 | _ | 

F 1 F 2 -F 3 -F 4 AiA2A;jA4 

And then, 

(6/3|4F|6;3|4i;’|6|i3ser/|i?ofc^|4|4F|12|^p™j|i3pTn^/|8|6o|4F|6A2 |4 f| ■ • •) 

In the next step, the enzyme Bpml recognises its site and binds 
molecule cleaving it, which results in: 


(32) 

(33) 
to the 


^6/31 4f 16/31 dj’161 fcR} I 141^ I 


I=:=^|6a2|4f|--- 


(34) 


where the sticky-ends are susceptible of binding, which yields the following 
configuration: 


(6/3|4F|6/3|4F|6|fclRl|?^|4|4F|6A2 |4 f| • • •) (35) 

which we consider as a new initial configuration for the machine, as we did 


for the configuration (24). At this point, the head is placed over the character 


A2. We will continue exploring this branch of the computation later in this 
proof (see below). We continue developing the different cases of Ai. 


CASE 3: < ^o, Ai = 1 > So, the string (|25[) is actually the following string: 


n I I I ® 1 ® 2 B 3 -B 4 B 5 -B 6 I . ,r. I a I 

6/?!- 1 I- Trzr|4F|6A2|4F| 


(36) 


At this moment, transition T 2 should take place. Such a molecule (151 at¬ 
tempts to bind to the main molecule which results in the following: 


(esi—I 


I -B1B2B3S4S5B6 I 


I 14 p 16 ^ 14 i? 161 I1314^7’1121 I 18161161 I 

(37) 

After enzymes BsrDI and Bbvl act upon this transition molecule, we obtain 
the following pair of strings: 


6/3 


I -®l®2^3-®4-®5®6 I 


F 3 F 4 


Fi F 2 F 3 F 4 


|6/3|4F|6|fc 


1|3|4f|12| 


?|8|: 


B1S2S3S4 


( 38 ) 





















The sticky-ends of the last molecule are susceptible of binding together, 
which results in the following molecule: 

(6,s|4F|6/3|4F|6|is»e-R/|Fcfc^7|3|4i?|12|ijpm/|Bpm^|8|6i|4i?|6A2|4i;’| • • •) (39) 

Next, enzyme Bpml recognises its binding site and cleaves the molecule 
resulting in: 

(^6^|4F|6/3|4F|6|LeH/|Fofc)|3|-^| I ^ |6 a2 |4f| ■ ■ ■ ^ (40) 

\ ^2 ^3 ^4 / 

The sticky-ends of the last molecule are susceptible of binding together, 
which yields the following configuration: 

(6/3|4F|6/3|4F|6|fcHi|?^|3|4F|6A2 |4 f| • • ■) (41) 

which analogously to CASE 2 is a new configuration for the machine, whose 
development will be studied below. At this moment, the head is reading 
character A2 after writing character /? as expected. 


With our previous exposition, we have exhausted all the possible cases for 
character Ai, we proceed now to explore the branches of the computation that 
resulted from developing the first character. Given that CASE 1 represents a 
state in which the machine has stopped, we analyse the other two remaining 
case and branch accordingly. Now, we explore the CASE 2 branch, that is, 
when Ai = 0, we have as initial configuration the strand: 

(6^|4F|6/3|4F|6|LeH/|Fofc)|4|4F|6A2|4F| ■ ■ •) (42) 


The head is currently over the character A2. When enzymes FokI and Bserl 
act upon this string, they yield the following string: 


(6;3|4f| 6^|^| |^2i2i2i^|4F|6A3|4F|---) (43) 


where the sticky-end A22A23A24A25 reports that the machine is currently in 
state Si with input A2, which was expected from the transition rules after reading 
character 0 on the tape. Let us explore the different cases for A2. 


CASE 2.1: < Si, X 2 = (3 > So, the string M3| is actually the string: 


6/3|4f| 6;3|^| |^^^^^i^|4F|6A3|4F| 
f>6 


(44) 


According to our transition rules, when the machine reads character /? in 
state Si, transition Tq takes place that is, molecule (191 attempts to bind to 
the main molecule, which looks like: 


6/3|4f|6/3|- 




?6 


I sTfn) I 4 f 16£ I 4f 161 fcrR/1 fIT) 1514F1121 feiAl I silAi) 1816^ 171 fen I 


( 45 ) 














Once the enzymes BsrDI and Bbvl have cleaved the transition molecule, the 
string becomes: 




II 


-P’3-^4 


Fi F 2 F 3 F 4 


16e 14/^ 161 I F^fc^ 1514/d 1121 I 181 




^1/^2/33^4/55 


( 46 ) 

Here, the sticky-ends are susceptible of binding together. When this happens, 
the string becomes: 

( 6 / 3 | 4 F| 6 / 3 | 4 i?| 6 e| 4 F| 6 |ij.eii/|Fofc)| 5 | 4 i?| 12 |ijpm/|Bp™)| 8 | 6 / 3 | 4 i;’| 6 A 3 | 4 F| • • •) ( 47 ) 
Then, enzyme Bpml cleaves the molecule, which results in the following: 


6/3 1 4f 1 6/3 1 4f 1 6e I 4f 1 6 1 fcR} 1777) 1 5 1^ I 


-P’1 -P’2 •P’3 •P’4 


HSasKfI ■ ■ 


( 48 ) 


We see sticky-ends that are susceptible of binding together. When this hap¬ 
pens the resulting molecule is: 


(6/3|4F|6/3|4F|6e|4F|6|fcs|F7fc)|5|4F|6A3|4 f| • ■ •) 


(49) 


Up to this point, we know that the head is placed over the character A3 and 
it has written the symbol e on the tape. This string can be considered as a 
new initial configuration for the machine, but before we continue exploring 
its branch, we take a look at the remaining cases. 


CASE 2.2: < S'!, A2 = 0 > So, the string (43) is actually the string: 


6 / 3 | 4 f| 6 / 3 | — I |^^^^^^| 4 f| 6 a 3 | 4 f| 


(50) 


Given the current state of the machine and the character being read, the 
transition T4 must take place, and thus molecule 0 attempts to add to 
the main molecule: 


(6s|4,|6s|SS| 

I sUd) 14 F16114 F161 fclB/1 TTfe) 1514 F1121 feU/1 bTU) 1816 0171 )j77/1 

(51) 

When enzymes BsrDI and Bbvl act upon the transition molecule they pro¬ 
duce the following string: 

(6„|4,|6s|;ci| 

I I 61 |4F|6|^seii7 |Fofc)|5|4F| 12|^p77i/|Bpm)|8| - | 

-P’1 ■P’2 ■P ’3 •P ’4 ^lAl 2 ^ 3 ^ 4'^5 

(52) 



















which has sticky-ends susceptible of binding together. When they do so, the 
resulting molecule is: 

(6/3|4F|6^|4i^|6l|4F|6|i3»eH7|Fofc^|5|4i?|12|i3pm7|Bpm^/|8|6o|4i?|6A3 |4i?| • • •) (53) 
Next, the enzyme Bpml cleaves the molecule, which results in: 

(^6/3|4i?|6/3|4F|6i|4i;’|6|LeR/|j?ofc)|5|-!-^| | ^ |6Aj4f | ■ ■ ■ ^ (54) 

\ ^^1 ^2 ^3 -^4 J 

Again, the molecule has sticky-ends susceptible of binding together. Thus, 
the molecule becomes: 


(6/3|4i^|6/3|4F|6i|4F|6|LeB/|Fpfc)|5|4F|6A3|4F| • • •) 


(55) 


At this point, the head is placed over the character A3 after writing character 
1 on the tape., which is expected after reading characters 00 on the tape. The 
new state is Sq and below we describe what happens if we further elaborate 
this case. 


CASE 2.3: < S'!, A2 = 1 > Then, string (431 is actually the string: 


6^|4^|6;5|^| 


(56) 


In this situation, the machine is in state Si and input A2 = 1, thus transition 


T5 should take place. When the molecule representing this transition (181 
attempts to bind to the main molecule, we obtain: 


I I I I I I Br\ BriB 

(^6^|4f|6;3|^| 

I I 61 |4p’ I 5 |4/P | 12 | ^pm/| Bpm) | 8 | 61 |7|.^i)p/| 

(57) 

Here, enzymes BsrDI and Bbvl recognise their sites and cleave the molecule, 
which results in: 

I _ ^^ | 6 l |4j;’ |6|^seH7 |Fofc)|5|4j?| 12|^pm7|Bpm)|8| - | 

-B1S2S3B4B5 

(58) 

This allows the sticky-ends to bind together, which produces the following 
strand: 

(6/3|4i7|6/3|4F|6l|4F|6|)5s«H7|Fofc)|5|4i7|12|i3pm7|Bpm)|8|6l|4i7|6A3 |4 f| • • •) (59) 

Finally, the enzyme Bpml cleaves the molecules producing: 


^6/314^716/31 4f 1611 4f 161 fclT? 1777) 151I 


l:==^|6A3|4i.|--- 


(60) 



















And after the sticky-ends bind together, we obtain the molecule: 


(6,g|4i^|6/3|4F|6i|4F|6|LeH/|Fofc^/|5|4F|6A3|4F| • • •) (61) 

Which can be considered a new initial configuration for the machine. The 
head is now placed over the character A 3 in state Sq after writing character 
1 on the tape, as expected after reading the string 01 . 


We wonder what would happen if we further developed the recently described 
cases. We answer this question in the following observation. 

Observation 1. Let us take a look at the tapes that result from following the 


three last cases (2.1,2.2,2.3). First thing we observe is that strings (55) and 
(61) are the same. Moreover, strings (49), (55) y ^61\) correspond to state Sq 


with input A 3 differing only in the contents of its tape. This strings are similar 


to the initial configuration of the machine, string ( 24 ), and thus, we expect to 


get the same results when branching from it, no matter the current contents of 
the tape. In other words, we would expect the same molecular reactions to occur, 
which has been described above. This is valid when considering Ai = 0. On the 
other hand, if we consider Ai = 1 we can further extend the computing branch 
of the machine. We proceed in this direction below. 

We have the following DNA strand as initial configuration for the machine, 
in which the head is reading character A 2 . 


(6^|4i^|6/3|4i?|6|LeH/|Fcfc)|3|4F|6A2|4F| • • •) 


(62) 


Enzymes FokI and Bserl act upon this molecule, so that we obtain the fol¬ 
lowing: 

(6«|4r|65|i3| (63) 


Here, the sticky-end reports that the machine is currently in state S 2 with 
input A 2 . Once again, we explore the different cases for the value of A 2 . 


CASE 3.1: < S 2 , X 2 = (4 > So, the string ( |63[ ) is actually the string: 

(6p\^F\6p\^\ |^’^^| 4 ^| 6 a 3 | 4 ^|- 


(64) 


Given the input X 2 = /3 and state S 2 we expect transition Tg to take place. 


This transition is given by the molecule (22) and on attempting to bind to 
the main molecule results in: 

(6^|4^|6^|^| 1 "^^ 

I 14^^ I Gg 14^ 161 I 1514^71121 I1816^ 181 I 


( 65 ) 
















After enzymes BsrDI and Bbvl have recognised their sites, the molecule 
becomes: 


(6^14^16^1 —I I — 

I - |6g |4f |6|^3e-Rj|Fofc^|5|4f’|l2|^pTnj|Bpm^|8| _ | 

F 1 F 2 F 3 F 4 ^ 1 ^ 2 / 33 / 34 / 35/33 

( 66 ) 

Which binds to the main molecule producing: 
(6/3|4i/’|6/3|4i;’|6e|4p’|6|i3.eii3|Fo/,)|5|4i?|12|ijpTP/|Bp™)|8|6/3|4i;’|6A3|4F| • • •) (67) 


Next, enzyme Bpml recognises its site and cleaves the molecule, thus pro¬ 
ducing: 


6,81 4f 16,914i? 16e 14i? 161 fclA 1777^ 151 


F3F4 


f^o.F:iFA 


|6a3|4f| 


( 68 ) 


When the complementary sticky-ends of this molecule bind together the 
molecule becomes: 


(6/3|4F|6,8|4F|6e|4F|6|fc7i|77fc^|5|4F|6A3 |4 f| ■ ■ •) 


(69) 


Which is a new initial configuration for the machine, and will be analysed 
later in this proof, meanwhile we know that the head is placed over character 
A 3 and it has just written character £ on the tape as it was expected from 
input 1,5. 


CASE 3.2: < 5 ' 2 , A 2 = 0 > So, string (63) is actually the string: 


( 6 ; 3 | 4 f| 6 ; 3 |^-^| |^-^-^| 4 f| 6 a 3 | 4 f| ■ • • ) 


(70) 


For input A 2 = 0 in state S 2 we expect transition T 7 to occur. Thus, the 
molecule becomes: 




I A3A4A5A6 I 


I b77d^ I 4f 1 61 1 4f 161 fcTfu I f77^ 151 4f 1121 1 bTTI) 1 8 1 60 1 8 1 feTr I 


After enzymes BsrDI and Bbvl have cleaved the molecule, we obtain: 


(71) 


f|6^| —I 


I A^AaAkAg, 


-P’3-^4 


Fi F 2 F 3 F 4 


|6i|4F|6|fc 


;|5|4f|12| 


H8|- 


Ai A2 A3 A4 A5 Ae 


(72) 

Here, the sticky-ends are complementary, which means that they are suscep¬ 
tible of binding together. After this happens we obtain: 

(6/3|4F|6/3|4F|6l|4F|6|i3seH7|Fpfc)|5|4F|12|(3pm7|f3pm)|8|6o|4F|6A3 |4 f| • ■ ■) (73) 





















Next, enzyme Bpml acts upon the string, which becomes: 


6/31 4_f 16/31 1611 161 fcsi I151 


F3F4 




( 74 ) 


Fi F2 F3 F4 

Which becomes the following strand after the sticky-ends bind together: 

(6/3|4i^|6/3|4F|6i|4F|6|fcif|777)|5|4F|6A3|4F| • • •) (75) 

This last strand can be considered as a new initial conhguration for the 
machine. We develop this branch of the computation below. Meanwhile, it 
is enough to say that currently the head is placed over character A3, and it 
has just written the character 1 on the tape. 

CASO 3.3: < 32,^2 = 1 > Then, the string (631 is actually the string: 


6/3|4f|6/3|- 




|4f|6a3|4f| • • • ^ 


(76) 


For this combination of input and state we expect transition Tg. When 
the molecule (211 representing this transition attempts to bind to the main 
molecule, we see: 


f|6/3|^| 


I S3B4B5S 


I 14 ^ 16 014 id 161 IF^ 1514 ^7-1121 I 18161181 fcr? I 

(77) 

This molecule is not yet able to bind to the main molecule. It is only when 
enzymes BsrDI and Bbvl recognise their sites that the transition molecule 
exhibits sticky-ends that are complementary to the main molecule: 


f\6p\ — \ 


I B3B4 


F3F4 


Fi F2 F3 F4 


|6o|4id|6|^seK/|F^fc^|5|4id|l2|^p7n/|Bpm^|8| ' 


B1B2S3S4B5B6 

(78) 

When the complementary sticky-ends bing together, they yield the following 
molecule: 


(6/3|4F|6/3|4F|6o|4F|6|)3seH7|Fofc)|5|4F|l2|i3pm7|i3pm)|8|6l|4F|6A3 |4 f| • • •) (79) 

Next, the enzyme Bpml cleaves the molecule after recognising its site. This 
results in: 

(^6/3|4F|6/3|4F|6o|4F|6|LeRj|j7ofc)|5|-!-^| | ^ |6 a3|4f| • • 4 (80) 

After the complementary sticky-ends bind together, we obtain the following 
molecule: 

(6/3|4F|6/3|4F|6o|4F|6|LeHj|Fofc)|5|4F|6A3|4F| ■ • •) (81) 

This strand can be considered a new initial conhguration. Moreover, we know 
that the head is currently placed over the character A3 and that the character 
0 has just been written recently on the tape. 



















At this point, it is natural to wonder what happens if we further develop the 
CASES recently presented. In the following observation we give an answer to 
this question. 


Observation 2. Let us take a look at the tapes that result from applying CASES 
3.1, 3.2 y 3.3, that is, strings (69), (75) and (81). As noted previously, these 
strings are similar to each other and to the initial configuration (24). Thus, we 
can expect the same behaviour from our TM for the rest of the computation. In 
other words, we have exhausted all the possible behaviours of the machine, thus 
its operation will ever fall in one of the cases described above. 


The above not only proves that our transition molecules work correctly on 
the molecular tape, but also that they write what is expected on the tape. □ 







Fig. 1. Molecular TM working on input 01 






































































